Reconstructing Causal Biological Networks through Active Learning
نویسندگان
چکیده
Reverse-engineering of biological networks is a central problem in systems biology. The use of intervention data, such as gene knockouts or knockdowns, is typically used for teasing apart causal relationships among genes. Under time or resource constraints, one needs to carefully choose which intervention experiments to carry out. Previous approaches for selecting most informative interventions have largely been focused on discrete Bayesian networks. However, continuous Bayesian networks are of great practical interest, especially in the study of complex biological systems and their quantitative properties. In this work, we present an efficient, information-theoretic active learning algorithm for Gaussian Bayesian networks (GBNs), which serve as important models for gene regulatory networks. In addition to providing linear-algebraic insights unique to GBNs, leading to significant runtime improvements, we demonstrate the effectiveness of our method on data simulated with GBNs and the DREAM4 network inference challenge data sets. Our method generally leads to faster recovery of underlying network structure and faster convergence to final distribution of confidence scores over candidate graph structures using the full data, in comparison to random selection of intervention experiments.
منابع مشابه
Structure Learning of Causal Bayesian Networks: A Survey
Causality is a fundamental concept in reasoning. The effectiveness of many reasoning tasks depends on the understanding of the underlying cause-effect relationships. Therefore, the notion of causality has been explored in a wide range of disciplines. Causal discovery, however, was not modeled as a machine learning task until recently. Many learning approaches have recently been developed and ap...
متن کاملSupervised Inference and Reconstruction of Biological Networks
The vast and fast development of computational and statistical methods has increased the number of applications on reconstructing the structure of large-scale biological networks. Technical feasibility of pattern recognition algorithms and the increasing availability of data repositories provide both challenges and opportunities on reconstruction of biological networks. In this paper, I will pr...
متن کاملA Hybrid Symbolic-Statistical Approach to Modeling Metabolic Networks
Biological systems consist of many components and interactions between them. In Systems Biology the principal problem is modeling complex biological systems and reconstructing interactions between their building blocks. Symbolic machine learning approaches have the power to model structured domains and relations among objects. However biological domains require uncertainty handling due to their...
متن کاملA Bayesian Approach to Learning Causal Networks
Whereas acausal Bayesian networks represent probabilistic independence, causal Bayesian networks represent causal relationships. In this paper, we examine Bayesian methods for learning both types of networks. Bayesian methods for learning acausal networks are fairly well developed. These methods often employ assumptions to facilitate the construction of priors, including the assumptions of para...
متن کاملLearning temporal probabilistic causal models from longitudinal data
Medical problems often require the analysis and interpretation of large collections of longitudinal data in terms of a structural model of the underlying physiological behavior. A suitable way to deal with this problem is to identify a temporal causal model that may effectively explain the patterns observed in the data. Here we will concentrate on probabilistic models, that provide a convenient...
متن کامل